CDS

Accession Number TCMCG078C16169
gbkey CDS
Protein Id KAG0477178.1
Location complement(join(43092169..43092293,43096758..43096827,43096913..43097035,43097118..43097225,43097695..43097902,43097985..43098411,43098546..43098751,43098922..43099149,43100071..43101410,43110708..43110857))
Organism Vanilla planifolia
locus_tag HPP92_014019

Protein

Length 994aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000006.1
Definition hypothetical protein HPP92_014019 [Vanilla planifolia]
Locus_tag HPP92_014019

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 8 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R05191        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K13648        [VIEW IN KEGG]
EC 2.4.1.43        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00520        [VIEW IN KEGG]
map00520        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAGGCGGAGGGCGTTGGAATGGTGGAGATGGACGCCGCTTAGGCTCGTGGATTGGATCTGGTCGATTCTTGGTGTCTTCCTCGTCGCGGTTCTTGTCCTCTTCGTCGTGCAGCACCATCACCTCATCCCTCGCCAGCTGCCAATGCAGGTCAAAGGCACAGAGTTTGAAGCAATTCAAGCTGAGAAGTTGAACTTTACAGAAGAACTGTTGAGTAGCACATCATTTGCCAGGCAATTGGTTGATCAGGTCTCCCTAGCAAAAGCTTACTTAGTTCTTGCCAAGGAGCATGGTAACCTTGATTTTGCTTGGGAGCTTAGCTCACATATTAGAAACTGCCAAATATTGCTTTCTCAGGCGGCAATGAGTGGGAAGCGCATTACATTTGAGGAAGCCCATCCTGTTGTTCTTCAGCTCGCAAAGTCCATCTACAAAGCCCAAGATTACCACTATGATATCAGCACCAGTATTACAACTCTAAAGAAACATGCACAAGCTCTGGAGGAGCGTGCCATTGCCGCAACAGCACAGAGTGCAGCATTTGGCAGATTGGCTGTCAACTCCTTGCCAAAGAATCTCCGGTGTGTGAATGTCAAACTCATAACAGATTGGTTTGAAGACCCTAAACTCAAACAGCGTGCGGAAGAGCTGAAGAACTCCCTCCGGTTGACAGACATCAACCTATACCATTTCTGTCTCTTCTCAGATAATGTTCTGGCGACTTCAGTTGTGGTGAATTCTACCATTGCAAACGTAAAGCATCCACTACAGCTTGTCTTCCATGTGGTTACCAACAGCATCAGTTACAAAGCAATGGCTACCTGGTTCTTGAAGAATGACTTGAAAGGGTGCACAGTTTTGGTGAGAAGCGTCGAGGAGTTGTCCTGGTTGAATGAACCCTTCTCACCAGTGTTTGAACATCTGGCAAGAGCGGGAAAGGGAAGTTGGGATATGGGTTCACCCTCAATACTTGAATACCTGCGATTCTACATCCCAATGCTTCATCCATCTCTGGAGAGGATTGTGATTCTTGATGAAGACATTGTGGTTCAAAAAGATCTGACTCCTCTCTTCTCCCAGAACATGCATGGAAGTGTCATAGCGGCCGTGGAGACTTGCCTCGAGTCGTCCCATCGGCTTTACCATTATGTCAACTTTTCTCATCCTCTTATCAGCTCGACCTTCGATCCCCAGGTCTGTGGCTGGGCATTTGGGCTAAATGTGGTGGACCTTATAGCATGGAGGAAGTCAGATGTCACTGCCAGGTTTCATTACTGGTTGAAGCAGAATGCAGATCAAACCCTATGGAGGGATGGGATTTTGCCAGCAGGTCTCCTGGCATTTTATGGACTAATGGTTCCTCTTGATAGGAGATGGCATGTTCTTGGCTTGGGATACGACATGGAACTTGATGATAGGTTGATAGGAAGTGCAGCCAGCTTACACTTTAATGGCAATATGAAACCATGGCTGAAGTTGGCAATCAGCAGAAGGATGTGGAACCGGAAGAGAAGGCGGGAAGATCCGCCGTCAATCCATCCTCGTAACCGTTACGCTGATGAGCCGCCAGACTTTGGTCTCCTTGCCTCCCTGTACCCTTCCTTCAAGCAATTCGTCTTCTCCTCCCGTTCGGGCCGCCCCGCAATAGACTGGAAAGACTACAATGCCACCCGCGAGCTCACTCGCGTACTCCTCCTCCACGACCATGGCATCAATTGGTGGATTCCTGATGGCCAACTTTGCCCAACGGTGCCAAATCGTTTAAACTACATTCACTGGATTGATGACCTGCTATCTTCTGACCTCATCCCTAAAAGACAGACTTCGAATAACAAAGTCAAAGGCTTTGATATCGGCACTGGGGCTAACTGCATATACCCGCTCCTCGGTGCATCTTTACTTGGTTGGGAGTTTGTTGGCTCAGATGTCACAAAAGTAGCCCTTGAATGGGCTACAAAAAATGTTGAGAGCAACCCTAAGCTATTGGAACTCATCAAGATTAGGGATGCTACTGATCCATTTAGTTGTAGTGATGCTACTCAGAGTACAAGGGAGCTCGTTAGTGAGCTTCCTTCAAAATTGTTTTTTGTAGAGAAGGATGAGTCCCAAGGTCAAGAGCTGAAGGAGTGTGGAACTGTGCAACCGCCTGTACTTGTGGGTGTTGTTAAAGAAGGCGAAACTTTTGACTTTTGTATATGTAACCCTCCATTTTTTGAGAGCATTGAGGAAGCAGGTCTCAACCCGAAGACATCATGTGGTGGAACAACTGAAGAGATGGTTTGCCCTGGTGGAGAAATAACTTTTGTTACACAGATCATCAAGGATAGTGTTGTCCTCAAGTGTTCATTTAGGTGGTTCACAGTAATGATTGGGAGAAAGATTAACTTAAAAAGTCTAATGTCAAAGCTACGTGAAGTTGGAGTGTCTATAGTCAAAACTACGGAGTTTGTCCAGGGTCGTACAGCTCGATGGGGGCTTGCTTGGAGTTTCATGCCACCATGCAAGGACTTCATTTCATCTACTGTAGCTTTGAAAAGCCATTGTTCATTTACACTTGAGGGCCTGAACCGCCAATGTGGTGCATTTCAAGTCTTAAAAGCAGTGGAATCATTTTTCTTAGACAAGGGTGTTCCTTGTAAAATCGACTCTTCATCCTTCTGTATCAATGTAAATTTAAACAATGTGCAAGATAACACGGCAAATGAAATGGGCTTGAGTGATTTGTTAAAAGATGCTGAAAATCACTCCACAAAAGTATCAAATGGATCATCGTGTGCAGCACTTGTTTCGGTATTTGAACAAATTCCTGGTACAATCCTAGTCAGGTGTTCGCCATTTGGAAAAGATGGAACAGTTTCAGGACTATTGTCTTCCCTATTCATCCATTTGGAGGAACACCTCCGAAAGGAGTTCAGCGGCAAGTCCCACGGTTCACTTCATAAACAAGAATCTAAGAAACCATCGCTTGATGAGACATCACATTAG
Protein:  
MRRRALEWWRWTPLRLVDWIWSILGVFLVAVLVLFVVQHHHLIPRQLPMQVKGTEFEAIQAEKLNFTEELLSSTSFARQLVDQVSLAKAYLVLAKEHGNLDFAWELSSHIRNCQILLSQAAMSGKRITFEEAHPVVLQLAKSIYKAQDYHYDISTSITTLKKHAQALEERAIAATAQSAAFGRLAVNSLPKNLRCVNVKLITDWFEDPKLKQRAEELKNSLRLTDINLYHFCLFSDNVLATSVVVNSTIANVKHPLQLVFHVVTNSISYKAMATWFLKNDLKGCTVLVRSVEELSWLNEPFSPVFEHLARAGKGSWDMGSPSILEYLRFYIPMLHPSLERIVILDEDIVVQKDLTPLFSQNMHGSVIAAVETCLESSHRLYHYVNFSHPLISSTFDPQVCGWAFGLNVVDLIAWRKSDVTARFHYWLKQNADQTLWRDGILPAGLLAFYGLMVPLDRRWHVLGLGYDMELDDRLIGSAASLHFNGNMKPWLKLAISRRMWNRKRRREDPPSIHPRNRYADEPPDFGLLASLYPSFKQFVFSSRSGRPAIDWKDYNATRELTRVLLLHDHGINWWIPDGQLCPTVPNRLNYIHWIDDLLSSDLIPKRQTSNNKVKGFDIGTGANCIYPLLGASLLGWEFVGSDVTKVALEWATKNVESNPKLLELIKIRDATDPFSCSDATQSTRELVSELPSKLFFVEKDESQGQELKECGTVQPPVLVGVVKEGETFDFCICNPPFFESIEEAGLNPKTSCGGTTEEMVCPGGEITFVTQIIKDSVVLKCSFRWFTVMIGRKINLKSLMSKLREVGVSIVKTTEFVQGRTARWGLAWSFMPPCKDFISSTVALKSHCSFTLEGLNRQCGAFQVLKAVESFFLDKGVPCKIDSSSFCINVNLNNVQDNTANEMGLSDLLKDAENHSTKVSNGSSCAALVSVFEQIPGTILVRCSPFGKDGTVSGLLSSLFIHLEEHLRKEFSGKSHGSLHKQESKKPSLDETSH